Detecting Signals in Noisy Data - Can Ensemble Classifiers Help Identify Adverse Drug Reaction in Tweets?
نویسندگان
چکیده
In this paper, we describe our system for detecting adverse reactions from tweets, a task organized as part of Pacific Symposium of Biocomputing Social Media Mining Shared Task. The shared task primarily involves a binary classification of the tweets, whether they contain description of adverse drug reaction or not. We propose an ensemble machine learning classifier to tackle the unbalanced distribution of the classes in the data provided for the task. A feature set containing unigrams, bigrams, and trigrams (a selected list, using mutual information), co-occurrence of drug and side effect, negation, sentiment score, etc. were used to train the classifiers in identifying the tweets with the description of adverse drug reactions. Our system obtained 0.4195 F-score and ranked first in the shared task.
منابع مشابه
Combining Classifier Guided by Semi-Supervision
The article suggests an algorithm for regular classifier ensemble methodology. The proposed methodology is based on possibilistic aggregation to classify samples. The argued method optimizes an objective function that combines environment recognition, multi-criteria aggregation term and a learning term. The optimization aims at learning backgrounds as solid clusters in subspaces of the high...
متن کاملA Novel Ensemble Approach for Anomaly Detection in Wireless Sensor Networks Using Time-overlapped Sliding Windows
One of the most important issues concerning the sensor data in the Wireless Sensor Networks (WSNs) is the unexpected data which are acquired from the sensors. Today, there are numerous approaches for detecting anomalies in the WSNs, most of which are based on machine learning methods. In this research, we present a heuristic method based on the concept of “ensemble of classifiers” of data minin...
متن کاملCombining Classifier Guided by Semi-Supervision
The article suggests an algorithm for regular classifier ensemble methodology. The proposed methodology is based on possibilistic aggregation to classify samples. The argued method optimizes an objective function that combines environment recognition, multi-criteria aggregation term and a learning term. The optimization aims at learning backgrounds as solid clusters in subspaces of the high...
متن کاملطراحی و روش نمونهگیری مطالعه آگاهی، نگرش و عملکرد خانوارها و کارکنان بهداشتی در خصوص تغذیه و ریزمغذیها در استانهای پایلوت برنامه
Background and Objectives:To compare three different methods of signal detection applied to the Adverse Drug Reactions registered in the Iranian Pharmacovigilance database from 1998 to 2005. Materials and Methods:All Adverse Drug Reactions (ADRs) reported to Iranian Pharmacovigilance Center from March 1998 through January 2005, were included in the analysis. The data were analyzed based on thr...
متن کاملمقایسه روشهای اپیدمیولوژیک در شناسایی سیگنالهای عوارض دارویی ایران
Background and Objectives:To compare three different methods of signal detection applied to the Adverse Drug Reactions registered in the Iranian Pharmacovigilance database from 1998 to 2005. Materials and Methods:All Adverse Drug Reactions (ADRs) reported to Iranian Pharmacovigilance Center from March 1998 through January 2005, were included in the analysis. The data were analyzed based on thre...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015